AITopics | model parallelism

Collaborating Authors

model parallelism

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

d01eeca8b24321cd2fe89dd85b9beb51-Supplemental.pdf

Neural Information Processing SystemsFeb-11-2026, 06:46:20 GMT

parallelism, piper, tensor parallelism, (15 more...)

Neural Information Processing Systems

Country:

Europe > Italy > Calabria > Catanzaro Province > Catanzaro (0.04)
North America > United States > California > Santa Clara County > Palo Alto (0.04)
North America > Canada (0.04)
Europe > Sweden > Stockholm > Stockholm (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.94)
Information Technology > Artificial Intelligence > Natural Language (0.93)
Information Technology > Hardware (0.68)

Add feedback

Piper: MultidimensionalPlanner forDNNParallelization

Neural Information Processing SystemsFeb-11-2026, 06:46:15 GMT

In the "modern era", such model-parallel training techniques trace their roots back to AlexNet [14] and early influential systems such as DistBelief [6] and Project Adam [3].

artificial intelligence, machine learning, parallelism, (17 more...)

Neural Information Processing Systems

Country:

North America > United States (0.05)
Europe > Italy > Calabria > Catanzaro Province > Catanzaro (0.05)
North America > Canada (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

a37d615b61f999a5fa276adb14643476-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-11-2026, 02:53:21 GMT

algorithm, bandwidth, parallelism, (15 more...)

Neural Information Processing Systems

Country:

Europe > Switzerland > Zürich > Zürich (0.14)
North America > United States > Virginia (0.05)
North America > United States > Oregon (0.05)
(8 more...)

Genre: Research Report (0.67)

Industry: Information Technology > Services (0.48)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.88)

Add feedback

Piper: Multidimensional Planner for DNN Parallelization

Neural Information Processing SystemsDec-24-2025, 22:47:58 GMT

The rapid increase in sizes of state-of-the-art DNN models, and consequently the increase in the compute and memory requirements of model training, has led to the development of many execution schemes such as data parallelism, pipeline model parallelism, tensor (intra-layer) model parallelism, and various memory-saving optimizations. However, no prior work has tackled the highly complex problem of optimally partitioning the DNN computation graph across many accelerators while combining all these parallelism modes and optimizations.In this work, we introduce Piper, an efficient optimization algorithm for this problem that is based on a two-level dynamic programming approach. Our two-level approach is driven by the insight that being given tensor-parallelization techniques for individual layers (e.g., Megatron-LM's splits for transformer layers) significantly reduces the search space and makes the global problem tractable, compared to considering tensor-parallel configurations for the entire DNN operator graph.

dnn parallelization, multidimensional planner, name change, (6 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.61)

Add feedback

ASAP: an Agentic Solution to Auto-optimize Performance of Large-Scale LLM Training

Ding, Yuran, Chen, Xinwei, Zhang, Xiaofan, Zhou, Zongwei

arXiv.org Artificial IntelligenceNov-7-2025

Optimizing large-language model (LLM) training on distributed domain-specific accelerator systems presents significant challenges due to its complex optimization space. Existing optimization methods, however, rely on time-consuming manual tuning or resource-intensive black-box searches, which struggle to keep pace with the rapidly evolving LLM domain, leading to slow development and underutilized resources. To address this, we introduce ASAP, an Agentic Solution to Auto-optimize Performance of Large-Scale LLM Training. It is a multi-agent system, featuring Coordinator, Analyzer, and Proposal agents, which integrates LLM reasoning with insights from performance profiling tools, roofline analysis, and a knowledge base of best practices and successful past optimizations from human experts. Our proposed design can automate the diagnosis of performance bottlenecks and recommend optimized sharding configurations with reasoning, thus effectively improving the efficiency of distributed LLM training. Experiments have shown that the ASAP-generated sharding configurations can contribute up to 28% training step time reduction and 1.43 times throughput improvement. When combined with additional optimization from human experts, throughput can be further increased to 2.58 times. The proposed ASAP promises to provide a scalable and explainable methodology for AI-assisted performance engineering in large-scale LLM training.

artificial intelligence, large language model, natural language, (16 more...)

arXiv.org Artificial Intelligence

2511.03844

Country: North America > United States (0.28)

Genre: Research Report (0.64)

Industry: Transportation (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Add feedback

Supplementary material of LoCo: Local Contrastive Representation Learning

Neural Information Processing SystemsOct-3-2025, 09:16:10 GMT

In this section we show the block structure of each stage in Progressive ResNet-50 in Table 1. The results are shown in Fig 2. We can see LoCo learns image embedding Last, we show qualitative results of detection and instance segmentation tasks on COCO in Figure 1.

grp 1 1, loco, supplementary material, (13 more...)

Neural Information Processing Systems

Country: North America > Canada > Ontario > Toronto (0.21)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.31)

Add feedback

ffe10334251de1dc98339d99ae4743ba-AuthorFeedback.pdf

Neural Information Processing SystemsAug-20-2025, 11:28:24 GMT

We thank the reviewers for their thoughtful comments. But consider the case of training BERT on a TPU pod, which takes around 4 days. We provide a formalization of the problem with rigorous guarantees. We now address a few of the specific reviewer concerns. However, in the revised version of this paper we will include a more thorough discussion of this. That post draws on Courcelle's theorem (namely, every graph property definable in the monadic second-order We feel that it's more accurate to avoid

algorithm, model parallelism, rematerialization, (13 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.31)

Add feedback

Piper: Multidimensional Planner for DNN Parallelization

Neural Information Processing SystemsAug-17-2025, 11:54:48 GMT

In the "modern era", such model-parallel training techniques trace their roots back to AlexNet [

artificial intelligence, machine learning, natural language, (19 more...)

Neural Information Processing Systems

Country:

Europe > Italy > Calabria > Catanzaro Province > Catanzaro (0.04)
North America > United States > California > Santa Clara County > Palo Alto (0.04)
North America > Canada (0.04)
Europe > Sweden > Stockholm > Stockholm (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.94)
Information Technology > Artificial Intelligence > Natural Language (0.93)
Information Technology > Hardware (0.68)

Add feedback

d01eeca8b24321cd2fe89dd85b9beb51-Paper.pdf

Neural Information Processing SystemsAug-17-2025, 11:54:45 GMT

artificial intelligence, machine learning, natural language, (18 more...)

Neural Information Processing Systems

Country:

Europe > Italy > Calabria > Catanzaro Province > Catanzaro (0.04)
North America > United States > New York > New York County > New York City (0.04)
North America > United States > California > Santa Clara County > Palo Alto (0.04)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language (0.94)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.69)

Add feedback

Decentralized Training of Foundation Models in Heterogeneous Environments

Neural Information Processing SystemsAug-17-2025, 08:46:10 GMT

Training foundation models, such as GPT -3 and PaLM, can be extremely expensive, often involving tens of thousands of GPUs running continuously for months.

artificial intelligence, machine learning, natural language, (19 more...)

Neural Information Processing Systems

Country: